DEDUCING THE MULTIDIMENSIONAL 
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Tim Austin 
Abstract 

We offer a new proof of the Furstenberg-Katznelson multiple recurrence 
theorem for several commuting probability-preserving transformations T%, 
T2, . . . ,Td : Z rv (X, S, fi) ([6]), and so, via the Furstenberg correspon- 
dence principle introduced in Q, a new proof of the multi-dimensional Sze- 
meredi Theorem. We bypass the careful manipulation of certain towers of 
factors of a probability-preserving system that underlies the Furstenberg- 
Katznelson analysis, instead modifying an approach recently developed in HI 
to pass to a large extension of our original system in which this analysis 
greatly simplifies. The proof is then completed using an adaptation of ar- 
guments developed by Tao in ['13 ] for his study of an infinitary analog of 
the hypergraph removal lemma. In a sense, this addresses the difficulty, 
highlighted by Tao, of establishing a direct connection between his infini- 
tary, probabilistic approach to the hypergraph removal lemma and the infini- 
tary, ergodic-theoretic approach to Szemeredi's Theorem set in motion by 
Furstenberg ll5l . 
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1 Introduction 



We give a new ergodic-theoretic proof of the multidimensional multiple recur- 
rence theorem of Furstenberg and Katznelson Q, which their correspondence 
principle shows to be equivalent to the multidimensional Szemeredi Theorem. 

THEOREM 1.1 (Multidimensional multiple recurrence). Suppose thatTi, T 2 , . . . , Td : 
Z rx (X, S, jj) are commuting probability-preserving actions and that AgE has 
/i(A) > 0. Then 

1 - 

l T inf m E/^O 4 ) n T 2 -(A) n ■ ■ ■ n t^(A)) > o, 

N—>oo iv ^ ' 
n=l 

an J so, in particular, there is some n > 1 wiYn 

/x(Tf n (A) n T 2 - n (A) n • • • n r d ~ n (A)) > o. 



Our proof of Theorem ll.il will call on some rather different ergodic-theoretic ma- 
chinery from Furstenberg and Katznelson's. Our main technical ingredients are 
the notions of 'pleasant' and 'isotropized' extensions of a system. Pleasant exten- 
sions were first used in HI to give a new proof of the (rather easier) result that the 
'nonconventional ergodic averages' 

1 N 

n=l i<d 

associated to fx, f 2 , . . . , fd & L°°(fi) always converge in L 2 (fi) as N — >• oo. (This 
was first shown by Tao in [fT4l . although various special cases had previously been 
established by other methods [J2l [3l |T5l [TUl [TTL [T6l SJ .) Much of the present paper 
is motivated by the results used in HI to give a new proof of this convergence. 
Isotropized extensions are a new tool developed for the present paper, but their 
analysis is closely analogous to that of pleasant extensions. 
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After passing to a pleasant and isotropized extension of our original system, the 
limit of CQ) takes a special form, and in this paper it is by analyzing this expression 
that we shall prove positivity. It turns out that this special form enables us to 
make contact with the machinery developed by Tao in lfT3l for his infinitary proof 
of the hypergraph removal lemma. Since the hypergraph removal lemma offers 
a known route to proving the multidimensional Szemeredi Theorem (as shown, 
subject to some important technical differences, by Nagle, Rodl and Schacht lfT2ll 
and by Gowers [8]), and this in turn is equivalent to multidimensional multiple 
recurrence, Tao's work already offers a proof of multiple recurrence using his 
infinitary removal lemma. In a sense, our present contribution is to short-circuit 
the above chain of implications and give a near-direct proof of multiple recurrence 
using Tao's ideas. Unfortunately, we have not been able to make a reduction to a 
simple black-box appeal to Tao's result; rather, we formulate (Proposition 16. II) a 
closely-related result adapted to our ergodic theoretic setting, which then admits 
a very similar proof. With this caveat, our work addresses the question of relating 
infinitary proofs of multiple recurrence and hypergraph removal explicitly raised 
by Tao at the beginning of Section 5 of [fT3l : it turns out that his ideas are not 
directly applicable to an arbitrary probability-preserving Z rf -system, but becomes 
so only when we enlarge the system to lie in the special class of systems that are 
pleasant and isotropized. 

Acknowledgements My thanks go to Vitaly Bergelson and David Fremlin for 
helpful comments on an earlier version of this paper, and to the Mathematical 
Sciences Research Institute where a significant re-write was undertaken after a 
serious flaw was discovered in an earlier version. 



2 Basic notation and preliminaries 

Throughout this paper (X, E) will denote a measurable space. Since our main re- 
sults pertain only to the joint distribution of countably many bounded real- valued 
functions on this space and their shifts under some measurable transformation, by 
passing to the image measure on a suitable shift space we may always assume that 
(X, S) is standard Borel, and this will prove convenient for some of our later con- 
structions. In addition, /i will always denote a probability measure on S. We shall 
write (X e , £® e ) for the usual product measurable structure indexed by a set e, and 
/z® e for the product measure and fi Ae for the diagonal measure on this structure 
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respectively. We also write 7Tj : X e — ■> X for the i coordinate projection when- 
ever j 6 e. Given a measurable map : (X, E) — > (Y, <3>) to another measurable 
space, we shall write o ^ for the resulting pushforward probability measure on 

If T : T rx (X, S,/i) is a probability -preserving action of a countable group 
r, then by a factor of the quadruple (X, E, /i, T) we understand a globally T- 
invariant sub-cr-algebra $ < E. The isotropy factor is the sub-a-algebra of those 
subsets AgE such that /j,(AAT~* (A)) = for all 7 G T, and we shall denote it 
by E T . If Ti, T 2 : T rx (X, /i) axe two commuting actions of the same Abelian 
group, then we can define another action by (T 1 _1 T 2 ) 7 := T/ T^, and then we 
write E Tl=T2 for E T i 72 , and similarly if we are given a larger number of actions 
of the same group. The most important kind of morphism from one T-system 
T : T rx (X, E, /i) to another S : T rx (Y, $, z/) is given by a measurable map 
: X — > Y such that u — (f) o fj, and S 1 o = o T: we call such a a factor 
map. In this case we shall write : (X, E, /i, T) — > (Y, $, z/, 5). To a factor map 
we can associate the factor {0~ 1 (A) : A G $}. 

Our specific interest is in rf-tuples of commuting Z-actions Ti, i = 1,2, ... ,d. 
Clearly these can be interpreted as the Z-subactions of a single Z d -action corre- 
sponding to the d coordinate directions Z • < Z d . 

Given these actions, we shall make repeated reference to certain factors assembled 
from the isotropy factors among the Tj. These will be indexed by subsets of 
[d] := {1, 2, ... , d}, or more generally by subfamilies of the collection (^) of 
all subsets of [d] of size at least 2. On the whole, these indexing subfamilies will 
be up-sets in (^): J C (^ 2 ) such that u G X and [d] D v D u imply v G X. 

For example, given e C [d] we write (e) := {u G (^) : u D e} (note the 
non-standard feature of our notation that e G (e) if and only if |e| > 2): up-sets of 
this form are principal. We will abbreviate ({i}) to (i). It will also be helpful to 
define the depth of a non-empty up-set X to be min{ |e| : eel}. 

The corresponding factors are obtained for e = i 2 , ■ ■ ■ , 4} ^ [d] with k > 2 
by defining $ e := E Tl i =Ti 2 = - =Ti fc, and given an up-set X C by defining 

From the ordering among the factors $ e it is clear that $1 = $^4 whenever A C 
(^) is a family that generates X as an up-set, and in particular that $ e = $( e ). 

An inverse system is a family of probability-preserving systems : T rx 
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(XW, E {m \^ m) ) together with factor maps 




from this one can construct the inverse limit 



lim (X (m) ,S (m) ;y u (m) ,T (m) ) 



as described, for example, in Section 6.3 of Glasner 0. 

Finally, the following distributional condition for families of factors will play a 
central role through this paper. 

DEFINITION 2.1 (Relative independence for factor-tuples). 7f > are fac- 
tors of (X, E, /i) for each i < d, then the tuple of factors (Si, £ 2 , • • • , w 
relatively independent over the tuple (Hi, E 2 , . . . , if whenever fi e L°°(/i|~ s J 
/or eac/z 2 < d we /zave 



3 The Furstenberg self -joining 

It turns out that a particular rf-fold self -joining of fi both controls the convergence 
of the nonconventional averages CQ) and then serves to express their limiting value. 

Given our commuting actions and any e = {i x < i 2 < . . . < 4} Q [d], we define 
fil(Ai x A 2 x • • • x A fc ) 



for Ai, A 2 , . . . , A% G S. That these limits always exist (and so this definition 
is possible) follows from the convergence of the nonconventional averages CD, 
although approaches to convergence that use this self -joining (as in H), or for var- 
ious special cases in [fT51 and IfToTl ) actually handle both kinds of limits alternately 
in a combined proof of their existence by induction on k. 




J im 4 E MV^i) n Tr^A 2 ) n • ■ ■ n ^]) 



n=l 
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Given the existence of the limits (OQ) and the assumption that (X, S) is standard 
Borel, it is easy to check that ^ extends to a /c-fold self -joining of fi on E® e . This 
is the Furstenberg self -joining of /i associated to T^, T, 2 , . . . , T ik . It is now clear 
from our definition that the assertion of Theorem ll.ll can be re-stated as being that 
if > then also fj^ d ^A d ) > 0. It is in this form that we shall prove it. 

The following elementary properties of the Furstenberg self-joining will be im- 
portant later. 

LEMMA 3.1. Ife = {ii<i 2 <...< ik] Q e' = {ji < j 2 < ■ ■ ■ < ji} then 

7T{ii,i 2 ,...,i fe } ° t4> = ^fi- 
PrOOf This is immediate from the definition: if A* . G £ for each j < k then 

{K{h,i 2 ,~,ik} ° t£>){ A l X A 2 x • • ■ x A k ) 



1 - 

& at E M 2 * 1 W n W n • ■ ■ n W) 



n=l 

where := Aj if j G e and Sj := X otherwise; but then this last average 
simplifies summand-by-summand directly to 

1 - 

& n E M^TM n T r(^) n • ■ ■ n V(40) 

n=l 

= : l4(Ai x A 2 x ■■■ x Ak), 
as required. □ 
LEMMA 3.2. For anv e C [d] ?/ze restriction fj^ \$®e is just the diagonal measure 

(/'V) Af - 

Proof If e = {ii < i 2 < • • • < h} an d Aj G $ e for each j < k then by 
definition we have 

1 - 

f/ e (Ai><A 2 x.--xA k ) = Hm -^^ n (^i) n W)n-n^"(4)) 

^ n=l 



& ^ E a*CC(^i n A 2 n • • • n A*)) 

n=l 

/i(AinA 2 n---n4), 



6 



as required. □ 

It follows from the last lemma that whenever e C e! the factors 7r i _1 ($ e ) < E® e ' 
for i G e are all equal up to fi^, -negligible sets. It will prove helpful later to have 
a dedicated notation for these factors. 

DEFINITION 3.3 (Oblique copies). For each e C [d] we refer to the common 
[^-completion of the sub-a-algebra ^l l {^ e ), i G e, as the oblique copy of$ e , 
and denote it by $f . More generally we shall refer to factors formed by repeatedly 
applying n and V to such oblique copies as oblique factors. 



It will be important to know that Furstenberg self -joinings behave well under in- 
verse limits. The following is another immediate consequence of the definition, 
and we omit the proof. 

LEMMA 3.4. // 

— y (X( m+1 ), E^ m+1 \ yS m+1 \ T ( ^ m+1 - ) ) ^ m > (X^ m \ E^, T*™)) — > 

is an inverse system with inverse limit (X,E,/i, T), then the Furstenberg self- 
joinings ({X^)\ (E( m ))® d , (fx^)f d] ,T xd ) with factor maps 0* d also form an 

inverse system with inverse limit (X d , E® d , jlj* , T xd ) . □ 



4 Pleasant and isotropized extensions 

We now introduce the main technical definitions of this paper: that of 'pleas- 
ant systems', closely following [1], and alongside them the related notion of 
'isotropized systems'. Recall that to a commuting tuple of actions Ti, T 2 , . . . , : 

Z rx (X, E, /i) we associate the factors 

indexed by subsets e = i 2 , . . . , ik] ^ [d]. 

DEFINITION 4.1 (Pleasant system). A system (X, E, /i, T) is (e, i)-pleasant for 

some i G e G (>|) z/f/ze z th coordinate projection 7Tj is relatively independent from 
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the other iTj, j G e, over the factor -K i ( Vye e \{i} un der the Furstenberg 

self-joining fi^: 




whenever fj G L°° (/i) for each j G e. 

It is fully pleasant if it is (e, i) -pleasant for every pair i G e. 

DEFINITION 4.2 (Isotropized system). A commuting tuple of actions Ti, T 2 , . . . , : 

Z r> (X, S, /i) is (e, i)-isotropized for some i G e G (J^) z/ 

$ e n ( V = V $ eum 

j'e[d]\e j6[d]\e 

up to ^-negligible sets. 

It is fully isotropized if it is (e, i) -isotropized for every (e, i). 

Intuitively, both pleasantness and isotropizedness (say when e = [d]) assert that 
the factors are 'large enough' : in the first case, large enough to account for all 
of the possible correlations between the coordinate projections under the Fursten- 
berg self-joining, and in the second to account for all of the possible intersection 
between $ e and the combination Vj e u]\ e U P to negligible sets. This notion 
of pleasantness is very similar to Definition 4.2 in [Q]|, where 'pleasant systems' 
were first introduced as those in which the larger factors S Tj V were 'char- 
acteristic' for the asymptotic behaviour of the nonconventional averages (OQ) in 
L 2 (/i). Here our emphasis is rather different, since we are concerned only with 
the integrals of these ergodic averages, rather than the functions themselves. For 
these integrals it turns out that we can discard the factors S Ti from consideration. 
This lightens some of the notation that follows, but otherwise makes very little 
difference to the work we must go through. 

Notice that the subset e C [d] is allowed to vary in both of the above definitions: 
this nuance is important, since the pleasantness property relating a proper subfam- 
ily of actions T i? i G e, is in general not a consequence of the pleasantness of the 
whole family, and similarly for isotropizedness. 

The main goal of this section is the following proposition. 
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PROPOSITION 4.3 (Simultaneously pleasant and isotropized extensions). Any 

commuting tuple of actions T\, T 2 , . . . , T d : Z rx (X, E, /x) admits an extension 
that is both fully pleasant and fully isotropized. 

This will rely on a number of simpler steps, many closely following the arguments 
of [0Q. We first show that any tuple of actions admits an (e, i)-pleasant extension 
and, separately, an (e, i) -isotropized extension. 

The first of these results is proved exactly as was Proposition 4.6 in £1], and so 
we shall only sketch the proof here. The idea behind the proof is to construct of 
a tower of extensions, each accounting for the shortfall from pleasantness of its 
predecessor, and then the pass to the inverse limit. 

LEMMA 4.4 (Existence of an (e, i) -pleasant extension). Any commuting tuple 
of actions T 1; T 2 , ... ,T d : Z rx (X, £,//) admits an (e,i) -pleasant extension 
(X,E,/2,f). 

Proof We form (X ,T:, fi,T) as the inverse limit of a tower of smaller exten- 
sions, each constructed from the Furstenberg self-joining of its predecessor. Let 
(X (1) ,EW,/iW) be the Furstenberg self-joining (X e ,£® e ,/^) and define on it 
the transformations 

and 

f fc :=(T fc ) xe forfc^i, 

and interpret it as an extension of (X, E, //, T) with the coordinate projection m 
as factor map. We now see that if fj E L°°(fx) for each j E e then 




and from the above definition that the factor of X e = X^ generated by (ttj) j<= e \{i} 
is contained in Vjee\{i} jV ^ we now i terate this construction to form (X( 2 \ S^ 2 \ T^) 
from (X^ 1 ', E^jju' 1 ', T^), and so on, then the approximation argument given for 
Proposition 4.6 of [HI shows that the inverse limit is (e, i)-pleasant. □ 

Remark Since the appearance of 0] , Bernard Host has given in [9J a method for 
constructing a pleasant extension of a system without recourse to an inverse limit. 
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However, we will make further use of inverse limits momentarily to construct 
an extension that is fully pleasant, rather than just (e, i)-pleasant for some fixed 
(e, i), and at present we do not know of any quicker construction guaranteeing this 
stronger condition. < 

A similar argument gives the existence of (e, i)-isotropized extensions. 

LEMMA 4.5 (Existence of (e, i)-isotropized extension). Any commuting tuple 
of actions T 1; T 2 , . . . ,T d : Z rx (X, E, ji) admits an (e, i)-isotropized extension 
{X, E, jl, f). 

Proof Once again we build this as an inverse limit. First form the relatively 
independent self-product (X^\ fi^) := (X 2 , E ®$ e E, /j, ®$ e p) with coordinate 
projections 7Ti, tx 2 back onto (X, E, /i), and interpret it as an extension of (X, E, p) 
through the first of these. Choose arbitrarily some i e e, and now define the 
extended actions T- 1 ^ on X* 1 ) by setting 

T (D . = J Tj x Tj if j # e, 
3 ' \ Tj x Tj if j e e; 

these all preserve fi^\ even in the latter case, because our product is relatively 
independent over the factor left invariant by each T~ x Ti for j e e. 

We now extend (X (1 \ E«, T«) to (A"( 2 >, £< 2 >, ji( 2 ),T( 2 >) by repeating the 
same construction, and so on, to form an inverse series with inverse limit (X, E, jl, T) . 

We will show that this has the desired property. Any / e -^°°(/4<i>en(V je[d]Xe j>)) 
may, in particular, be approximated in L 1 (//) by finite sums of the form ^ p rije[d]\e 0p,j 
with JiP G However, since //W is joined relatively independently 

conditioned on $ e and / is also $ e -measurable, it follows that / o m = f o n 2 
/i^-almost surely, and so in the extended system (X^\ Tp-\pP^) we can alter- 
natively approximate / o m by the functions J2 P Ylje[d\\e $v,i ° n 2> and now every 
0p,j ° ^2 is both manifestly ^j^-measurable, since both Tj and Tj are simply 
lifted to T* 2 and T* 2 , and manifestly -measurable, since all the transforma- 
tions T^ defined above for j e e agree on the second coordinate factor n 2 1 (H). 
Therefore / o m may be approximated arbitrarily well in L 1 (/i) by functions that 

are measurable with respect to Vje[d]\ e ^iufjV ^ ow anomer simple approxima- 
tion argument and the martingale convergence theorem show that the inverse limit 
system (X, E, jl, T) is actually (e, i)-isotropized, as required. □ 
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We will finish the proof of Proposition 14.3 1 using the following properties of sta- 
bility under forming further inverse limits. 

LEMMA 4.6 (Pleasantness of inverse limits). If 

w an inverse system with inverse limit (X, E, //, T) and ieeC [<i], 

• Z /(X( m ), S^ m ), j/M, T( m )) is (e, i) -pleasant for infinitely many m, then (X, E, /}, 
zs a/50 (e, i)-pleasant; 

• if [X^ m \Yl^ m \ ii^ m \T^ m ') is (e,i)-isotropized for infinitely many m, then 
(X, E, ft, T) is also (e, i)-isotropized. 

Proof We give the proof for the retention of (e, i)-pleasantness, the case of (e, i)- 
isotropizedness being exactly analogous. 

Since any 1 -bounded member of L°°(jl) may be approximated arbitrarily well in 
L 1 (/2) by 1-bounded members of L°°(yu( m )), by a simple approximation argument 
it will suffice to prove that given m > 1 and fj G L°°(/i( m )) for each j E e we 
have 

/ n fj ° ^ = E ^(fi\ v ^.j}) ° ^ ■ n & ° ^ 

However, by definition and Lemma I3~4l we know that after choosing any m\> m 
for which (X^ mi \ E (mi) , /i (mi) , T (mi) ) is (e, i)-pleasant the above is obtained with 

Vjee\{i} in P lace of Vjee\{i} and now letting m x -> 00 and appealing 

to the bounded martingale convergence theorem gives the result. □ 

It now remains only to collect our different properties together using more inverse 
limits, whose organization is now rather arbitrary. 

Proof of Proposition 14.31 Pick a sequence of pairs ((e m , i m )) m >i from the finite 
set {(e, i) : |e| > 2, i 6 e C [d]} in which each possible (e, i) appears infinitely 
often. Now one last time form a tower of extensions 

— > (X < - m+1 - 1 , vj( m + 1 ) ; yU ( m + 1 ) ; 2 1 ( m + 1 )) _» (X*-™- 1 , H^ m \ fi^ m \ T^) —>■... 

above (X, E, //, T) in which {X^ m \ ^M, T^) is (e (m+1)/2 , z (m+1)/2 )-pleasant 
when m is odd and (e m / 2 , z m / 2 )-isotropized when m is even. The two parts of 
Lemma |4~6l now show that the resulting inverse limit extension has all the desired 
properties. □ 
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5 Furstenberg self -joinings of pleasant and isotropized 
systems 



Having established that all systems have fully pleasant and isotropized extensions, 
it remains to explain the usefulness of such extensions for the proof of Theo- 
rem 11.11 This derives from the implications of these conditions for the structure 
of the Furstenberg self-joining. 

LEMMA 5.1. If the tuple T 1; T 2 , . . . , T d : Z r% (X, S, fi) is fully pleasant and 
fully isotropized, X C Q^j is an up-set and e is a member of (^) \ X of maximal 
size then the oblique copy $^ and the oblique factor $f are relatively independent 
over $ ln< e > under Mfd]- 

Proof Suppose that Fx G L 00 ^ t*f) and F 2 E L 00 ^ \**). It will suffice to 
show that 

/ FiF 2 dfif d] = [ E /t (Fi | $| n(e> ) • F 2 dfif d] . 
Jx d Jx d 

Pick i £ e. By Lemma [3T21 there is some fx E L°°(^\^ e ) such that Fx — fx o -K i 
/i^j -almost surely. 

Let {ax, a 2 , . . . , a k } be the antichain of minimal elements in X; this clearly gener- 
ates X as an up-set. Since e G" X we must have a,j \ e ^ for each j < A;. Pick 
i,- G aj\e arbitrarily for each j < k, so that, again byLemmaO = n^i® aj ) 
(up to yu^j -negligible sets). 

Now, since $f = Vj<fc ®aj> ^2 ma y t> e approximated arbitrarily well in L 1 (/x^) 

by sums of products of the form J2 P Ylj<k ^hv ° n i 3 w ^ m §j,v e •^ ,0Q (A t t*o-)» and 
so by continuity and linearity it suffices to assume that F 2 is an individual such 
product term. This represents F 2 as a function of coordinates in X d indexed only 
by members of [d] \ e, and now we appeal to Lemma [3T| and the pleasantness of 

^f[d]\e)U{i} t0 deduCe that 

/ F l II fa* ° Md] 

Jx* ?A 

E ^ (/l I V ' II ° ^ d ^fH\e)U{i} • 
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However, now the property that (X, S, /i, T) is (e, i)-isotropized and the fact that 
fx is already $ e -measurable imply that 

E ^(/l| V = E ^(/l| V $eU W 

je[d\\e j&[d]\e 

and since each e U {j} G X (by the maximality of e in V[d]\T), under 7Tj this 
conditional expectation must be identified with E^(F X | $j n ^), as required. □ 

COROLLARY 5.2. If the tuple Ti, T 2 , . . . , Tj : Z r\ (X, S, /i) w fully pleas- 
ant and fully isotropized andX,X' C are up-sets then $j an J are 
relatively independent over $f nJ , under p?L. 

Proof This is proved for fixed X by induction on X'. If X' CI then the result 
is clear, so now let e be a minimal member of X' \ X of maximal size, and let 
X" := T \ {e}. It will suffice to prove that if F G L°°(nf d] |^f ) then 



E |tf (F|^) = E Mr (F|^ 1P ) l 



and furthermore, by approximation, to do so only for F that are of the form Fx ■ F 2 
with Fx G L°° (/i^j ^ ) and F 2 G L°° (/i^ |~ $ f ; ) . However, for these we can write 

E#(F\& X ) = E#(E#(F\^,)\*$) 



"id] v 1 M ' ^[d] v ^[d] 



and by Lemma I5TI 



On the other hand (X U X") fl (e) C X" (because X" contains every subset of [d] 
that strictly includes e, since X' is an up-set), and so Lemma |5TT1 promises similarly 
that 

E, f ^Fi|^ux")n(e)) = E Mfd] (F 1 |$I„). 
Therefore the above expression for E^f (F | $f ) simplifies to 



E <f„ < F ' i ' R - 1 *') = E * < E * < F > ' F * 1 I *') 
= E * ( E * (F i I = E * (F i = E * 

by the inductive hypothesis applied to X" and X, as required. □ 



F 1 
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6 Completion of the proof 



We have now set the stage for our analog of Tao's infinitary hypergraph removal 
machinery. Observe first that the conclusion of Theorem 11.11 clearly holds for the 
commuting tuple T 1; T 2 , . . . , T d : Z rx (X, S, /i) if it holds for any extension of 
that tuple. Therefore by Proposition 14.31 we may assume our commuting tuple 
is fully pleasant and fully isotropized, and so need only prove for such fi that if 
> then fiJ d ^(A d ) > 0. For these particular fi Corollary 15 .21 gives us a very 
precise picture of the joint law under pF* of the poset of oblique factors $f , and 
hence actually of the inverse image factors ir^^j) for X C (i). 

Note that we have not tamed all of the potentially wild structure of the joint dis- 
tribution of the factors $i under p, but only that of the associated oblique factors 
under the Furstenberg self -joining /x^. It seems quite likely that in some cases the 
factors $j of the original system can still exhibit a very complicated joint distri- 
bution, even after passing to a fully pleasant and isotropized extension. However, 
the understanding of the oblique copies is already enough to complete the proof of 
multiple recurrence using a relative of Tao's 'infinitary removal lemma' in |fl3l . 
One of his chief innovations was an infinitary analog of the property of hyper- 
graph removability for a collection of factors of a probability space (Theorem 4.2 
of 11131 ). Here we shall actually make do with a more modest conclusion than his 
'removability', but our argument will follow essentially the same steps. We shall 
derive Theorem 11.11 as the top case of the following inductive claim, tailored to 
our present needs. 

PROPOSITION 6.1. Suppose that for i = 1,2, ... ,d and j = 1,2, ... ,ki 
are collections of up-sets in (J^|) such that [d] G Zjj C (i) for each i,j, and 
suppose further that the sets Aij G ^ j are such that 

"&(n(n^))=°- 

i=i j=i 

Then we must also have 

d k 

KnrW)=°- 

i=l j=l 

The following terminology will be convenient during the proof. 
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DEFINITION 6.2. A family (Z^-)^- /las the property P if it satisfies the conclu- 
sion of the preceding proposition. 

The conclusion of multiple recurrence follows from Proposition ^. H at once: 

Proof of Theorem 11.11 from Proposition 16.11 Suppose that A e E is such that 
/^(A d ) = 0. Then by the pleasantness of the whole system we have 

„ d 

»u Ad ) = j xd n i ° * = o. 

Now the level set Bi := {E^l^ | > 0} (of course, this is unique only up 
to /i-negligible sets) lies in and the above equality certainly implies that also 
yU^I (Bx x B 2 x • • - x B d ) = 0. Now, on the one hand, setting ki = 1, X iy i := (i) and 
A iA := Bi for each % < d, PropositionloTTItells us that n{B x n B 2 H • • • H = 0, 
while on the other we must have fi(A \ Bi) = for each i, and so overall p(A) < 
H{B X n B 2 n • • • H B d ) + £^ =1 /i(A \ = 0, as required. □ 

It remains to prove Proposition 16. II This will be done by induction on a suitable 
ordering of the possible collections of up-sets (Zjj),^, appealing to a handful of 
different possible cases at different steps of the induction. At the outermost level, 
this induction will be organized according to the depth of our up-sets (defined in 
Section 12). 

Let us first illustrate how the above reduction to Proposition 16.11 and then the 
inductive proof of that proposition combine to give a proof of Theorem ll.ll in the 
simple case d = 3. 

Example Suppose that Ti, T 2 , T 3 : Z rx (X, E, p) is a fully pleasant and fully 
isotropized triple of actions and that AeS has /ij^A 3 ) = 0. We will show that 
p(A) = 0. As in the above argument, we know that 

l*[3] (^) = I $<!>) ° TTi) • (E M (U I $(2)) O 7T 2 ) ■ (E„(U | $(3)) ° 7T 3 ) d/if 3] , 

and so we must actually have fxj^ (-Bi x B 2 x -B3) = where Bi := {E^l^ | > 
0}. Clearly A is contained in B\ C\B 2 HB 3 up to a /i-negligible set, so it will suffice 
to show that this intersection is /i-negligible. 

Now, each of $m = ${1,2} V ${1,3}, $(2) and $/ 3 \ can be generated using inter- 
sections of members from countable generating sets in each Let 

C % i})2 C . . . 
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be an increasing sequence of finite subalgebras of sets that generates up to 
//-negligible sets, and let 

H (n) _ = eTi=T2=T3 v B ^ y ^ ^ 

when {i,j,k} = {1,2,3}. By the martingale convergence theorem we have 
E M (l Bi | -> 1 B . in L 2 (/i) as n -> oo. Now pick 5 e (0, 1/3) and let Cf n) : = 
{E^l^ | 2- n ^) > 1 — 5}, so for large n this set should be a /i-approximation to 
and observe in addition that E^l^n)^ | ) < 5 almost surely. 

It easy to check from Corollary 15 .21 that must be relatively independent from 

7r i ~ 1 (5^ V H^) over 7r i " 1 (5^) under ^ when {i, j, A;} = {1, 2, 3}, and from 
this we compute that 

< 5 / l c (n) " l c (n, • l cW d/if 3] = ^f 3] (C{ n) X X <?<">). 

Therefore 

„g,(Cf > x cf x cf ) < ^(^ x 5 2 x B 8 ) + ^(Cf > x x cf ), 

and so since 5 < 1/3 we must have (C^ x C2 x C3 ) = for all n. 

The importance of this is that for large n we have now approximated the sets B { 
by sets that lie in the simpler cr-algebras but nevertheless still enjoy 
the property that the measure /x^ (C[ n ^ x C^ 1 x C3 ) is strictly zero. Since each 
B{i,j},n is finite, for any given n we may write each C 4 as a finite union of subsets 

of the form C^J = D hP n C^, p n C iifciP with A, P e S Tl=T2=T 3 andC ijiP e 
for every p, and these must now also enjoy the property that 

f^fsi {(Di. pl r}Ci : 2. pl r}Ci : 3, pl )x (i?i ) p 2 nC2 ( i ) p a nC2 ( 3 )Pa ) x (-Da^nCs^pgnCs^pa)) = o 

for all possible indices pi, p 2 > P3- 

Next the fact that ^(n^^AnJ^C)) = whenever C G %j} (Lemma[3]2]) 
comes into play, allowing us for example to move the set C2,i tP2 under the first 
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coordinate rather than the second in the above equation, and similarly. In this way 
we can re-arrange the above equation into the form 

n -^i,P2 n Di jP3 ) n (Ci )2 , pi n c 2 ,i, P2 ) n (Ci i3)P1 n c Si i !P3 )) 

x (c 2 , 3lP2 n c 3 , 2 , P3 ) x x) = o. 

This equation involves the sets D := D 1>P1 R -Di, P2 H -Di, P3 G S Tl=T2=T3 , (7 lj2 ■ = 
Ci,2, Pl n C 2 ,i, P2 G ${1,2}, Ci,3 : = C i,3,pi n ^3,1,^3 e ${i,3} and C 2j3 := C2,3, P2 n 
C3,2, P3 £ ${2,3}- Now, Corollary 15 .2l tells us that the three oblique copies $^ ^ are 
relatively independent over $^ 2 3 , under juL, and so we deduce from the above 
equation that 

•(E,(l Cli3 I E^=^) o ttO • (E,(1 C2 , 3 | E^=^ o vr 2 ) d/,f 3] 

= / Id ■ E,(l Cl , 2 I E^=^3) . £,(10,3 I E^=^) ■ E,(1 C2 , 3 | E^^a) d» 
Jx 

where the first and second line here are equal by Lemma l3^2l since all the functions 
involved are E Tl=T2=Ts -measurable. 

However, this now implies that 

p(Dn n {E,(lcJ S Tl=T2=T3 ) > 0}) = 

and hence that we must also have 

h(Di,p± n -Di, P2 n -Di,p 3 n Ci j2iPi n c 2j i iP2 n C\^, Pl n C3 ; i jP3 n c 2j 3 iP2 n C3 )2iP3 ) = o. 

Taking the union of these equations over triples of indices pi,p%, Pz gives //(C^ fl 
fl Cj^) = for any n, and so since the sets Cf 1 ' approximate Bi as n — > oo 
it follows that /i(-E?i fl £? 2 fl £? 3 ) = 0, as required. < 

We now turn to full induction that generalizes the above argument, broken into a 
number of steps. 

LEMMA 6.3 (Lifting using relative independence). Suppose that all upsets in 
the collection have depth at least k, that all those with depth exactly k 

are principal, and that there are I > 1 of these. Then if property P holds for 
all similar collections having £ — 1 up-sets of depth k, then it holds also for this 
collection. 
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Proof Letl il j 1 = (ei), Xj 2 j 2 = (e 2 ), 2^ = (e^) be an enumeration of 
all the (principal) up-sets of depth k in our collection. We will treat two separate 
cases. 

First suppose that two of the generating sets agree; by re-ordering if necessary 
we may assume that e\ = e 2 . Clearly we can assume that there are no duplicates 
among the coordinate-collections (Xi t j)jL 1 for each i separately, so we must have 
i\ 7^ i 2 . However, if we now suppose that A iy j e l iy j for each i, j are such that 



"&(n(rv«)H> 



then the same equality holds if we simply replace A^^ E (ei) with A' {l ^ := 
A il j 1 fl Ai 2t j 2 and A i2 j 2 with A' i2 j 2 := X. Now this last set can simply be ignored 
to leave an instance of a [if* -negligible product for the same collection of up-sets 
omitting X i2 J2 , and so property P of this reduced collection completes the proof. 



On the other hand, if all the are distinct, we shall simplify the last of the 
principal up-sets Ii e j e by exploiting the relative independence among the as- 
sociated oblique copies of our factors. Assume for notational simplicity that 
= (1, 1); clearly this will not affect the proof. We will reduce to an in- 
stance of property P associated to the collection (2jj) defined by 



T' ■ 



(e e )\{e e } if = (1,1) 



which has one fewer up-set of depth k and so falls under the inductive assumption. 

Indeeed, we know from Corollary 15.21 that under /x^ the set n^ l (Ai ) i) is rel- 
atively independent from all the sets n~ 1 (A i j), ^ (1, 1), over the factor 
7rf 1 ($( e< ,)\{ e j), which is dense inside the relevant oblique copy ®J ee \\{ e A- There- 
fore 

d k 
i=l j=l 

n k\ d kj^ 

= / e m (u m i $ (e ,) Uea ) o n ■ n n ^ 



J'=2 i=2 j=l 



Setting := {E M (l Al l | $< ei >\{ ei }) > 0} G $< e ,)\{ e ,} and := ^ for 
7^ (1, 1), we have that fJ,(Ai i \ A' t x ) = and it follows from the above 
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equality that also /i^ ( Ylf =1 ( f]f=i = 0' so an appeal to property P for the 

reduced collection of up-sets completes the proof. □ 

Remark The first very simple case treated by the above proof is the only step 
in the whole of the present section that is essentially absent from Tao's argu- 
ments in Sections 6 and 7 of [fT3l . Nevertheless, it seems to be essential for the 
correct organization of the present argument, since we need to allow for which 
of our sets are lifted under which coordinate projections in the hypothesis that 

/^(nii(n*iiA;)) = o. < 

LEMMA 6.4 (Lifting under finitary generation). Suppose that all up-sets in the 
collection have depth at least k and that among those of depth k there are 

£ > 1 that are non-principal. Then if property P holds for all similar collections 
having at most i — 1 non-principal up-sets of depth k, then it also holds for this 
collection. 

Proof Let T il ^ 1 , X i2 j 2 , . . . , Ii e j e be the non-principal up-sets of depth k, and 
now in addition let e±, e-i, . . . , e r be all the members of X it j t of size k (so, of 
course, r < (?)). Once again we will assume for simplicity that (ie,je) = (1, 1). 
We break our work into two further steps. 

Step 1 First consider the case of a collection (A^j)^ such that for the set A lfl , 
we can actually find finite subalgebras of sets B s E ${e s } for s = 1,2, ... , r such 
that Ai tt j t G B\ V £> 2 V • • • V B r V $ x n / m \ (so A^x lies in one of our non- 

principal up-sets of depth k, but it fails to lie in an up-set of depth k + 1 only 'up 
to' finitely many additional generating sets). Choose M > max s < r \B S \, so that 
we can certainly express 

M r 

Ai t i = [J (B„ h i n B m<2 n • • • n B mjT n c m ) 

m=l 

with B m s G B s for each s < r and C m G $ T n / w \. Inserting this expression 
into the equation 

"&(n(rK))=° 

i=i i=i 

now gives that each of the M r individual sets 

ki d ki 

((B m ,i n 5 m , 2 n • • • n B miT n c m ) n f] x JJ ( p| 

j=2 ' i=2 i=l 
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is //^-negligible. 

Now consider the family of up-sets comprising the original X iy j if i = 2, 3, . . . , d 
and the collection (ei), (e 2 ), . . . , (e r ), Xi j2 , Xi jfcl corresponding to i = 1. 

We have broken the depth-/c non-principal up-set Xi i into the higher-depth up-set 
2i, l H (>H J and the principal up-sets (e s ), and so there are only t — 1 minimal- 
depth non-principal up-sets in this new family. It is clear that for each m < M r 
the above product set is associated to this family of up-sets, and so an inductive 
appeal to property P for this family tells us that also 

ki d fcj 

p((fl m ,i n B mi2 n • • • n B m>r n c m ) n f] A ld nf)f) = o 

j=2 i=2j=l 

for every m < M r . Since the union of these sets is just DiLi fljii ^t,i» m ^ s gi yes 
the desired negligibility in this case. 

Step 2 Now we return to the general case, which will follow by a suitable lim- 
iting argument applied to the conclusion of Step 1 . Since any ${ e } is countably 
separated, for each e with \e\ = /c we can find an increasing sequence of finite 
subalgebras B e ,i Q £> ej2 Q ■ ■ ■ that generates <£>{ e } up to //-negligible sets. In 
terms of these define approximating sub-cr-algebras 

H S } : = ^nUl) V V B ^ 

eeZwn(M) 

so for each X, j these form an increasing family of a-algebras that generates ^ . 
up to /t-negligible sets (indeed, if X^j does not contain any sets of the minimal 
depth k then we simply have 5^ = $x. j for all n). 

Observe that by Corollary l5.2[ for each n we have that $^ x and V(i j-wn i) ^j -1 i"^) 
are relatively independent over 7rf 

Given now a family of sets (Ajj)^- associated to (Xjj) jj, for each (z, j) the condi- 
tional expectations E^l^ 3 . | sf^ ) form an almost surely uniformly bounded mar- 
tingale converging to l AlJ in X 2 (/i). Letting Br) := {E M (l Aj ^ | S£V) > 1 - 5} 
for some small 5 > (to be specified momentarily), it is clear that we also have 

fi(AijABij) — > as n — > oo. Let also 

d ki 
i=l j=l 
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We now compute using the above-mentioned relative independence that 



= j ( i ^)\a !J °^)-( n V) ^^ 



for each pair («, j) 



However, from the definition of we must have 

almost surely, and therefore the above integral inequality implies that 

^(F\tt- 1 (Aj)) < S [ ottO • ( J] V> O7r = 

From this we can estimate as follows: 

< ^&(n (n < o+(^^)^(f), 

i=i j=i (ij) i=i 

and so provided we chose <5 < ( Ylt=i ki) * we must in fact have /ij^ (F) = 0. 

We have now obtained sets (B^)^ that are associated to the family and 
satisfy the property of lying in finitely-generated extensions of the relevant factors 
corresponding to the members of the 2 it j of minimal size, and so we can apply the 

result of Step 1 to deduce that (flti f&i B tj) = °- 11 follows that 



K fl Pi ^) ^ E MAj' \ 4?) - as n ^ oo, 

i=l j=l i,j 

as required. □ 
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Proof of Proposition 16. II We first take as our base case k { = 1 and T i:1 = {[d]} 
for each i = 1, 2, . . . , d. In this case we know that for any A E the pre-images 
7r^ 1 (y4) are all equal up to negligible sets, and so given A\, A 2 , . . . , A d e we 
have = ^(Ai x A 2 x • • ■ x A d ) = fi(A l n A 2 n ■ ■ ■ n A d ). 

The remainder of the proof now just requires putting the preceding lemmas into or- 
der to form an induction with three layers: if our collection has any non-principal 
up-sets of minimal depth, then Lemma 16.41 allows us to reduce their number at 
the expense only of introducing new principal up-sets of the same depth; and hav- 
ing removed all the non-principal minimal-depth up-sets, Lemma [631 enables us 
to remove also the principal ones until we are left only with up-sets of increased 
minimal depth. This completes the proof. □ 
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